ON STANLEY'S RECIPROCITY THEOREM FOR RATIONAL CONES 



MATTHIAS BECK AND MIKE DEVELIN 

Abstract. We give a short, self-contained proof of Stanley's reciprocity theorem for a rational 
cone K. C R''. Namely, let (T)c(x) — X] me Acnzt* Then crK;(x) and (Tko(x) are rational func- 
tions which satisfy the identity (Tjic(1/x) = ( — l)'*crK;o (x). A corollary of Stanley's theorem is the 
Ehrhart-Macdonald reciprocity theorem for the lattice-point enumerator of rational polytopes. A 
distinguishing feature of our proof is that it uses neither the shelling of a polyhedron nor the concept 
of finite additive measures. The proof follows from elementary techniques in contour integration. 



1. Introduction 

Let /C be a rational cone, that is, of the form 

/C = |r G M'^ : Ar < o| 

for some integral m x d-matrix A. Thus /C is a real cone in defined by m inequalities. We 
assume that K, is d-dimensional and pointed: i.e., that /C does not contain a line. 

In his study of nonnegative integral solutions to linear systems, Stanley was led to consider the 
generating functions 

a^(x)= 

me/cnzd 

and its companion cr/c°(x) for the interior /C° of /C. Here x™ denotes the product 
The function cr/c (as well as (Jjqo) is a rational function in the components of x; this is proved, for 
example, by triangulating /C into simplicial cones, for which one can explicitly form the rational 
functions representing their generating functions (see, for example, Chapter 4]). The fundamen- 
tal reciprocity theorem of Stanley [71 relates the two rational functions 0"a; and cj^o . We abbreviate 
the vector (1/xi, l/x2, • • • , l/a;^) by 1/x. 

Theorem 1 (Stanley). As rational functions, cryc (x) = (— l)'^ cjyco (1/x). 



The proof of Stanley's theorem is not quite as simple as that of the rationality of fi/c- Most 
proofs of Theorem ^ that we are aware use either the concept of shelling of a polyhedron or the 
concept of finitely additive measures (also called valuations). Both concepts reduce the theorem to 
simplicial cones, for which Theorem ^ is not hard to prove. In this paper, we give a proof which 

Date: February 1, 2008. 

2000 Mathematics Subject Classification. 05A15, 52C07. 

Key words and phrases. Rational cone, generating function, Stanley reciprocity, lattice point, rational convex 
polytope, quasi-polynomial, Ehrhart theory. 

Thanks to Sinai Robins and Guoce Xin for helpful discussions. 

1 



2 



MATTHIAS BECK AND MIKE DEVELIN 



neither depends on shelUng nor on valuations. It is based on rational generating functions, which 
essentially go back to Euler. In [Jj Stanley used these generating functions to suggest a residue- 
computation approach to proving Theorem^ he then implemented it by computing each of the 
residues in gory detail. Here we show that one can, in fact, reduce Theorem^to a simple change of 
variables in the integral over the generating function, proving Stanley's reciprocity theorem without 
any actual computation of residues or other technical machinery. 



2. EuLER'S GENERATING FUNCTION 



Suppose we have an element (p of SL^{Z), i.e. a unimodular transformation preserving the integer 
lattice. We claim that the image 0(/C) will satisfy the reciprocity law if and only if IC does. Indeed, 
it is easy to see that the generating function ^"^(a;) is obtained by substituting x'^^^'^'^ for Xi (for all i) 
in the generating function cr^, and similarly the generating function (J^(^]<z°) is obtained from a]c° by 
the same substitutions. From this observation, we immediately obtain that if o"x;(x) = o"a:o(1/x), 
then 

cT^(^)(x) = (x<^(-«)) = (x-<^(->)) = a<^(;c°)(l/x), 
so o"0(AC) and cr^^^o-, also satisfy the reciprocity law. 

Now, given any pointed cone /C, we can find a unimodular transformation (j) so that (/'(/C) lies in the 
nonnegative orthant, and intersects its boundary only at (0,0, ... ,0): simply pick a lattice basis 
such that K, is contained in the interior of the cone it spans, and send this to the standard basis. 
(This is easy to do; for instance, take any hyperplane which intersects K, only at the origin, pick 
a lattice basis for the sublattice contained in it, and take the final point at lattice distance 1 in 
the direction of /C, and very far away in the direction opposite the d—1 points in the hyperplane.) 
Therefore, it suffices to consider the case where /C C M>o and (0, 0, ... , 0) is the only point in /C 
with any zero coordinate. We will assume that /C has this form from now on. 

Denote the columns of A by ai,...,arf. Our main tool for proving Theorem ^ is the following 
lemma, the idea of which goes back to at least Euler ^ and which is proved by simply expanding 
geometric series. 

Lemma 2. o"j<;(x) equals the constant z,- coefficient of the function 

1 

(1 - XlZ^l) (1 - X2Z^2) • • • (1 - XdZ^'*) (1 - Zl) (1 - Z2) • • • (1 - Zm) 

expanded as a power series centered at z = 0. 



We will use an integral version of this lemma, for which we need additional variables to avoid 
integrating over singularities. Let 

1 \ 



(1) Ojc{x,y) = Ok. (xi, ...,xd,yi,.. ■ ,ym) = const^ 

This is a rational function in the coordinates of x and y, and o"yc(x) = ^/^^(x, 1, 1, . . . , 1). The 
y- variables represent slack variables, one for each inequality; thus instead of Ax < and x > 0, 

we consider the cone given by (A [ /) f =0 and x,y > 0, where / is the n-by-n identity 
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matrix. We next translate this constant-term identity into an integral identity. We abbreviate the 
measure 
for [x[, |y[ < 1 



measure • • • %7 by and we write |x| < 1 to indicate that , \x2\ , • • • , 1x^1 < 1. Then 



^^(x, y) 



where the integral sign stands for l/(27ri)™ times an m-fold integral, each one over the unit circle. 
Note that although we are going to plug in y = (1, 1, . . . , 1), we may compute the rational function 
y) by evaluating it in any sufficiently large domain, such as the one used here. 



Analogously, we have for the open generating function 
a]<i° (x) = const 



1 — XiZ^^l 1 — X2Z'*2 1 _ XrfZ'^d 1 — Zi 1 — Z2 1 — 2„ 

and we define 

Id „ . m 



0/c° (x, y) = const, [] H T 



Then (T/(;°(x) = 9/c°{x, 1,1,...,!) and for |x|, |y| < 1 



/d J, , m , 

TT TT y^^fc 



• 1 - -J- ^ 1 - Vk^k Z 
j=l k=\ 

The integral representations of Q}q and Q}qo now suggest how to prove Theorem ^ — make a change 
of variables z — > 1/z, say in 0^o: 

Hence the rational function 0x:°(l/x, 1/y) has the integral representation 

dz 



TT — - — TT^— 

j=\ ^ k=l 

/d ^ m ^ 

TT — - — TT — - 



dz 



j=i - ^-J'' ^ fc=i ^ 2/^^^ ^ 
valid for |x|, |y| > 1. It remains to prove that the rational function given by the integral 

[ ^ 

with |x|, |y| < 1 equals the rational function given by the integral 

(3) (-ir / ^ — 
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with |x|, |y| > 1, since once we have this equahty, we have 6'^o(l/x, 1/y) = (— y); plugging 
in y = (1, 1, . . . , 1) (whereupon 1/y = (1, 1, . . . , 1) as well) shows that 0ac°(1/x) = Oic{x) as desired. 

Let us consider as the innermost integral, the one with respect to zi. Almost all of the poles of 
the integrand f{zi) are at the solutions zi of the equations 1 — xjz^^ = (for j = 1,. . . ,d) and 
1—yiZi = 0. Since |2;2| , l-Zal , . . . , \zm\ = 1, each zi-pole is inside or outside the unit circle, depending 
on the exponent of zi in z^^ and on whether a given Xj or yj has magnitude smaller or larger than 
1. But this means that the zi-integrals in Q and © pick up the residues of complementary 
singularities. 

The only other potential poles are at zero and infinity, induced by the extra factor of We claim 
that there are no residues at these poles, if they exist. Indeed, as zi approaches zero, the residue 
z^ f(z^) is the product of the factors -; — and — - — . Since K is in the nonnegative orthant, 

in each inequality at least one coefficient must be negative; therefore, in at least one of the former 
factors, the exponent of zi must be negative. This factor then goes to zero as zi does, while the 
norms of all of the other factors either go to zero (if the exponent of zi is negative), one (if the 
exponent is positive), or a constant (if zi does not appear at all.) Therefore, this residue is equal 
to zero. 

A similar argument eliminates the residue at infinity. After a change of variables zi ^ this 
residue at infinity is (up to sign) the limit of the product of these same factors. However, the 
factor — goes to zero, while all other factors again go to zero, one, or a constant, depending on 
whether the exponent of zi is positive, negative, or zero respectively. This completes the argument 
that the zi-integrals in and ^ differ by a minus sign. 

We can use the same argument for the next variable if we know that the zi-integral results in 
a rational function with a similar-looking denominator as the integrands in (j^J and ©. But 
this follows from Euler's generating function: The zi-integral in ^ gives the generating func- 
tion of the cone described by r > and the first row inequality in Ar < 0, in the variables 

{z2,..., Zmf''^ and y, where a'^, . . . , are the column vectors of A after 
we removed the first row. It is not hard to show (from the simplicial case) that the generating 
function of any such cone has as denominator a product of terms of the form 1 minus a monomial 
of the variables, so the 2;2-integrand has the desired form. 

There is a possible obstruction here, which is that these monomials may have x's and y's appearing 
both in the numerator and the denominator. If we pick a poor choice of norms of the x's and y's, 
we may not be able to evaluate these integrals, since the norm of this monomial may be 1, in which 
case there are poles on the unit circle. However, we can get around this simply by looking at an 
open set where none of these magnitudes are 1 to evaluate the integral for |x|, |y| > 1, and looking 
at the image of this set under x x~^,y y~^ to evaluate the integral (of the same rational 
function) for |x|, |y| < 1. 

We also need to show that the residues of the intermediate integrals at zero and infinity are both 
zero. As we evaluate the first integral, we obtain the sum of residues at poles, each of which has z\ 
equal to a monomial in the other variables, possibly with fractional powers. (If fractional powers 
bother you, simply replace each variable by a power of itself.) The corresponding residue is obtained 
by plugging this monomial in for z\ in all of the other factors, along with the remaining portion of 
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the factor which the pole was extracted out of. Each of these residues has denominator a product 
of terms of the form 1 minus a monomial of the variables. As we evaluate each integral, we perform 
further substitutions of monomials in the x's, y's, and later z's for each z-variable in succession. 

As Zi goes to zero or infinity, each factor in each residue goes to 1, 0, or a constant depending on 
whether the exponent of Zi is positive, negative, or zero, as in the first integral. We need to show 
that in each residue, one of the factors goes to 0. As goes to infinity, the factor -|_^ ^ , which 
is unblemished (since it has no other z- variables), goes to zero. The analysis when Zi goes to 
is a bit trickier. For each previous z-variable, representing an equality, we have picked one of the 
factors, corresponding to one x or y-variable. Substituting for that variable amounts to solving the 
given equality to express that variable in terms of the other variables, and plugging the expression 
into the other equalities to create a new system of equations. The residue will be given by the 
Euler-type generating function of this new system of equalities. 

With this formulation, it is clear that if we have a solution ( | , then the same vector with the 

deleted variable removed will be a solution of the new system of equalities; see Example yl Taking a 
vector with all positive entries and following it through the process of eliminating the first i — 1 rows 
by deleting the corresponding columns, we find that the system of equalities at the i-th step has 
an all-positive solution. From this, it follows that one of the entries in each row must be negative, 
and in particular there must be a negative number in the i-th row. The corresponding factor of 
the denominator will have Zi appearing with negative exponent, and thus will go to as Zi goes to 
0. So each summand has residue as zi goes to 0, and thus the entire i-th integrand does. 

Therefore, for each integral, we have complementary residues counted in integrals © and 
introducing a minus sign; after factoring this out, the two integrals produce identical rational 
functions which move on to the next integral. Since we have m integrals, we obtain that the 
integrals in and (jSJ differ by a factor of (—1)™ as desired. This completes the proof. 

It is worth noting that we made the decision to replace the i_J.ga^ factors in 0a:(x, y) by factors of 

i-x-z'^i ™ ^1^° (-^^ Since the facets correspond to the y- variables, we do not need strict inequalities 
on the x-coordinates in order to compute the generating function of IC°. Instead, we chose to use 
this expansion, since it yields the correct complementarity statement regarding the poles. 

3. An ILLUSTRATIVE EXAMPLE 

In this section, we give an example illustrating the proof of the previous section. 
Example 3. Let K he the pyramidal cone in the positive orthant given by the inequalities: 



Xl ■ 


- X2 


< 





Xl 


- X3 


< 





X2 - 


■ 2x1 


< 





X3 - 


■ 2x1 


< 






This is a cone with vertex (0,0,0) over a square in the xi = 1 plane with vertices {l,i,j) for 
i,j E {1,2}. 
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The matrix A is then 



/ 1 


_1 


\ 






1 

-2 



1 


-1 




; the 


modified matrix A 

















(I 


-1 


1 0\ 






1 





-10 10 






-2 


1 


1 






^-2 





1 1/ 



and the original integrand (to be integrated over zi, . . . , Z4) is: 



(1 



'){l-X2Z-^ z^){l-xzZ2 za){1 - yizi){l - y2Z2){l - yzzi){l - y^Zi) 



We iUustrate what happens when we compute a residue in an intermediate integral. Suppose we 
are in the process of computing the residue with respect to zi at the pole corresponding to the first 



factor. This amounts to substituting zi 



into all of the other factors. Consider the 



second factor, (l — X2Zi^z^). This becomes (l — X2 {xiZ2Z^'^z^'^) Z3); a moment's reflection will 
reveal that this is equivalent to adding an appropriate multiple of the first column of A' (here, 1) 
to the second column to cancel out the element in the first row. 



So, in effect, this residue is the Euler-type generating function of the matrix obtained by doing 
Gaussian elimination, adding a multiple of the first column (since we picked the xi term) to all 
other columns to eliminate the first row (since we are integrating over zi), and then deleting the 
first column. The only difference is that there are a few xi's thrown into each term, which are 
irrelevant for our purposes; recall that the point of this statement in the proof was merely to show 
that in this residue, some term exists with a negative power of each Zj. It suffices to show that this 
new matrix has a negative entry in each row. 

But, as noted in the proof, if we pick a positive solution to the original equation yl' [ ''^ ) =0, say 



ixi,X2,X3,yi,y2,y3,y4:) = (2,3,3, 1, 1, 1, 1), then if we remove xi , the new tuple (x2, xs, yi, 2/2, ys, 2/4) 
(3, 3, 1, 1, 1, 1) will be a solution of the new matrix, which in this case is 



1 -1-110 0' 
-1 2 1 
-2 1 2 1, 



That the elimination procedure has this property is easily verified; it comprises using the eliminated 
equation to solve for the eliminated variable in terms of the other variables, then plugging that into 
the other equations. 



Since there is an all-positive solution to the new set of equations, there must be at least one negative 
entry in each row. 



ON STANLEY'S RECIPROCITY THEOREM FOR RATIONAL CONES 



7 



4. Extensions and applications 



An interesting extension of Theorem ^ is the fohowing: Let 

/Ci = |r e M|o : A r < and B r < o| 

and 

/C2 = |r G : A r < and B r < o| , 

that is, ICi and IC2 are half-open cones whose constraints are defined by the same matrix, however, 
those facets (codimension-1 faces) which are contained in /Ci are not in IC2 and vice versa. Since 
by assumption (0, . . . , 0) is the only intersection point of the fuhy closed cone with the coordinate 
hyperplanes, at most one of /Ci and IC2 intersects these hyperplanes, so the domains are accurate. 

Theorem 4. As rational functions, if the set of facets closed in ICi is contractible, a/c^ (1/x) = 
(-l)^a^,(x). 

Stanley's original proof ^ showed that this was true whenever the set of facets consists of those 
visible from a certain point outside the cone; shelling-based proofs of Theorem^prove this theorem 
whenever the set of closed facets is contractible. 

Our proof technique is easily adjusted to this more general setting. 

Suppose A G U^^^ has columns ai, . . . , a^, and B G Z'^^" has columns bi, . . . , b^, then Lemma [21 
gives 



(j^^ (x) = constz 
and 

cTyca (x) = const 
The proof that 



1 1 Wl Wn 



z,w 



1 - XiZ^^iW^l) •••(!- XrfZ'^dwbd) [I - zi) ■ ■ ■{!- Zm) I - Wl l-Wn, 

XiZ^l XrfZ'^'^w'^'* Zi Z^ 1 

1 — XlZ^^W^^ 1 — XdZ^'i 1 — Zl 1 — Zm (1 — Wl) • • • (1 — Wn) 

1 TT 1 TT Urn+iWi dz dw 



/LL ^ lit lb 



and 

/d ah'™ ^ 1 77 
f-r XjZ-^^w"^ jj ykZk T-r 1 dzdw 
1 1 , — 11 ~^ 11 ' 
j = i k=l %=l 

both defined for |x|, |y| < 1, satisfy a Stanley-type reciprocity identity proceeds along the exact 
same lines as our proof of Theorem^ Note that as per the comment at the end of the previous 
section, even though the original cones tCi and /C2 are partially open, we can arbitrarily choose 
to "invert" all of the x-variables in IC2 and none in /Ci (since IC2 contains no points with any 
x-coordinate equal to zero.) 

However, this proof cannot work in all cases, since the generalized reciprocity theorem is not true 
in all cases. What can go wrong is that in the partially eliminated matrix, the term (or, rather, all 
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of the terms) with positive Wj-exponent can be those of the form i^^^^l^- , which does not in fact 
go to zero as Wi goes to infinity. It would be interesting to come up with an elegant charcterization 
of when this happens, which would provide an elegant proof of the generalized Stanley theorem for 
a subset of half-open cones. 

A particular nice application of Theorem ^ concerns the counting function Lp{t) := # {tV n Z^) 
for a rational convex polytope V, that is, the convex hull of finitely many points in Q"^. Ehrhart 
proved in jjlj the fundamental structural result about L-p(t), namely that it is a quasi-polynomial in 
t (for a definition and nice discussion of quasi-polynomials see [HI Chapter 4]). Ehrhart conjectured 
and partially proved the following reciprocity theorem, which was proved by Macdonald 

Theorem 5 (Ehrhart-Macdonald) . The quasi-polynomials L-p and L-po satisfy 

Lp(-t) = (-lf-^L^o(i) . 



5. Concluding remarks 

1. As already mentioned, most proofs of Theorems ^ and 13 use shellings of a polyhedron or finite 
additive measures (see, e.g., The only exceptions we are aware of are proofs via complex 
analysis (e.g., as above or in [7]) and commutative algebra (see, e.g., jSl Section 1.8]), as well as a 
recent proof [1] by the first author and Frank Sottile using irrational decomposition. 

2. It is a fun exercise to deduce Theorem El from Theorem for example by considering the 
generating function of the {d + l)-cone generated by (vi, 1) , . . . , (v„, 1), where vi, . . . , v„ are the 
vertices of "P, applying Stanley reciprocity, and then specializing the rational generating functions 
by setting the first d variables to 1. 

3. Finally, there exists an extension of Theorem [3 corresponding to Theorem |2 one includes some 
of the facets of the polytope on one side, and the complementary set of facets on the other side. 
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